Pancreas-derived serpin

ABSTRACT

The present invention provides nucleotide and amino acid sequences that identify and encode a novel pancreas-derived serpin (PDS) expressed in human pancreas. The present invention also provides for antisense molecules to the nucleotide sequences which encode PDS, expression vectors for the production of purified PDS, antibodies capable of binding specifically to PDS, hybridization probes or oligonucleotides for the detection of PDS-encoding nucleotide sequences, genetically engineered host cells for the expression of PDS, diagnostic tests based on PDS-encoding nucleic acid molecules and a pharmaceutical composition containing PDS capable of binding specifically to a serine protease.

FIELD OF THE INVENTION

The present invention is in the field of molecular biology; more particularly, the present invention describes the nucleic acid and amino acid sequences of a novel pancreas-derived serpin.

BACKGROUND OF THE INVENTION

Serpins

Serpins are extracellular, irreversible serine protease inhibitors. As a group, they are defined on the basis of their structural and functional characteristics--high molecular weight, 370-420 amino acid residues, and C-terminal reactive region. Proteins which have been assigned to the serpin family include the following: α-1 protease inhibitor, α-1-antichymotrypsin, antithrombin III, α-2-antiplasmin, heparin cofactor II, complement C1 inhibitor, plasminogen activator inhibitors 1 and 2, glia derived nexin, protein C inhibitor, rat hepatocyte inhibitors, crmA (a viral serpin which inhibits interleukin 1-β cleavage enzyme), human squamous cell carcinoma antigen (which may modulate the host immune response against tumor cells), human maspin (which seems to function as a tumor supressor; Zou Z et al (1994) Science 263:526-529), lepidopterian protease inhibitor, leukocyte elastase inhibitor (the only known intracellular serpin), and three orthopoxviruses (which may be involved in the regulation of the blood clotting cascade and/or of the complement cascade in the mammalian host).

In addition, a number of proteins with no known inhibitory activity are also categorized as serpins on the basis of strong sequence and structural similarities. They include bird ovalbumin, angiotensinogen, barley protein Z, corticosteroid binding globulin, thyroxine binding globulin, sheep uterine milk protein, pig uteroferrin-associated protein, an endoplasmic reticulum heat-shock protein (which binds strongly to collagen and could act as a chaperone), pigment epithelium-derived factor, and an estrogen-regulated protein from Xenopus.

The signature pattern for the serpins is based on a well conserved pro-phe sequence which is located ten to fifteen residues C-terminal to the reactive site loop (RSL). The serpin consensus pattern is LIVMFY!-x- LIVMFYAC!- DNQ!- RKHQS!- PST!-F- LIVMFY! LIVMFYC!-x- LIVMFAH!, and P is found in position 6 of the pattern in most serpins.

Serpins are defined and described in Carrell R and Travis J (1985) Trends Biochem Sci 10:20-24; Carrell R et al (1987) Cold Spring Harbor Symp Quant Biol 52:527-535; Huber R and Carrell R W (1989) Biochemistry 28:8951-8966; and Remold-O'Donneel E (1993) FEBS Lett 315:105-108.

Mode of Action

Protease inhibitors form tight complexes with their target proteases. For instance, small molecule inhibitors such as tetrapeptide keto esters form a covalent bond with the catalytic site of serine proteases and also interact with substrate-binding subsites. For the Kunitz family of protease inhibitors, extended interactions involving the entire substrate binding surface on both sides of the reactive site are utilized.

The region of a serpin which binds to the target protease is an exposed reactive site loop (RSL). In contrast to the above inhibitors, serpins have mobile RSLs. The RSL sequence from P17 to P8 is highly conserved, and small amino acid with side chains are found at positions P9, P10, P11, P12, and P15 in active inhibitors. Sequence divergence in the hinge region is usually associated with conversion of the molecule from an inhibitor to a substrate. In fact, proteolytic cleavage near the reactive site results in profound structural changes. Cleavage of the characteristic serpin P1-P1' bond of a1-proteinase inhibitor results in a separation of about 69 Å between the two residues (Loebermann H et al (1984) J Mol Biol 177:531-556). In addition, the peptide loop from P14-P2 (numbering from the active site P1-P1') is inserted into the middle of the A-sheet. These structural changes are accompanied by pronounced increase in stability to heat- or guanidine-induced denaturation and this change is referred to as the stressed-to-relaxed (S->R) transition. The ability of a serpin to function as an inhibitor may be directly related to its ability to undergo this S->R transition (Bruch M et al (1988) J Biol Chem 263:1662 6-30; Carrell R W et al (1992) Curr Opin Struct Biol 2:438-446). Ovalbumin, a noninhibitor of the serpin family, is unable to undergo this S->R transition.

To determine the role of small amino acids in the hinge region of protease nexin-1, Braxton S M et al (Keystone Symposium, 11 Mar. 1994) replaced glycine at position 331 (P15) with serine, alanine, proline and valine. The G₃₃₁ ->V mutation was nearly inactive, the G₃₃₁ ->P was completely inactive, and replacement of G₃₃₁ with S and A had a smaller effect on inhibition. P12 (A₃₃₄ >V) and P10 (A₃₃₆ ->V) mutations also significantly reduced activity. These mutagenesis experiments indicate that a portion of the RSL, up to at least P10, must incorporate into the A-sheet in order for PN-1 to act as an inhibitor, and mutations which hinder this structural transition cause PN-1 to act as a substrate.

Discovery

The exocrine pancreas produces an abundance of proteolytic enzymes such as trypsin, chymotrypsin, carboxypeptidase and the serine proteases which split whole and partially-digested proteins into polypeptides and smaller moieties. Several elastases and nucleases are also found in the pancreatic juice. Other digestive enzymes produced by the pancreas include pancreatic amylase which digests carbohydrates, and pancreatic lipase, cholesterol esterase, and phospholipase which hydrolyze lipids and fats.

The four molecules which control pancreatic secretion are acetylcholine and the hormones, gastrin, cholecystokinin (CCK), and secretin. Acetylcholine is released from the parasympathetic vagus and other cholinergic nerve endings, gastrin is secreted by cells of the stomach, and CCK and secretin are secreted by the upper small intestine. The gastrointestinal (GI) hormones are absorbed into the blood and transported to the pancreas where they stimulate the secretion of enzymes and of sodium bicarbonate and water (which wash the pancreatic enzymes into the duodenum).

The endocrine pancreas consists of islets of Langerhans, whose cells are separated from the exocrine lobules and are distributed throughout the pancreas. The endocrine cells of the islets secrete hormones which participate in the metabolism of proteins, carbohydrates, and fats.

The major endocrine cells are α, β, and δ cells; the minor cells are C cells, EC cells, and PP cells. About 15% of the islet cell population are a cells which are located along the periphery of islets and secrete the hormone glucagon. β cells comprise about 70% of the islet cell population, are located around the center of the islets, and secrete the hormone insulin. δ cells comprise about 10% of the population, are located close to α cells and secrete two different hormones, somatostatin and vasoactive intestinal peptide (VIP). C, EC, and PP cells make up the final 5% of the islet cell population. Although the function of C cells is unknown, EC and PP cells secrete seratonin and pancreatic polypeptide, respectively.

Inflammation of the pancreas or pancreatitis may be classified as either acute or chronic by clinical criteria. With treatment, acute pancreatitis can often be cured and normal function restored. Chronic pancreatitis often results in permanent damage. The precise mechanisms which trigger acute inflammation are not understood. However, some causes in the order of their importance are alcohol ingestion, biliary tract disease, post-operative trauma, and hereditary pancreatitis. One theory provides that autodigestion, the premature activation of proteolytic enzymes in the pancreas rather than in the duodenum, causes acute pancreatitis. Any number of other factors including endotoxins, exotoxins, viral infections, ischemia, anoxia, and direct trauma may activate the proenzymes. In addition, any internal or external blockage of pancreatic ducts can also cause an accumulation of pancreatic juices in the pancreas resulting cellular damage.

Anatomy, physiology, and diseases of the pancreas are reviewed, inter alia, in Guyton A C (1991) Textbook of Medical Physiology, WB Saunders Co, Philadelphia Pa.; Isselbacher K J et al (1994) Harrison's Principles of Internal Medicine, McGraw-Hill, New York City; Johnson K E (1991) Histology and Cell Biology, Harwal Publishing, Media Pa.; and The Merck Manual of Diagnosis and Therapy (1992) Merck Research Laboratories, Rahway N.J.

SUMMARY OF THE INVENTION

The subject invention provides a unique nucleotide sequence which encodes a novel pancreas-derived serpin, also known as pds. The new gene, which was identified from Incyte Clone 222689, encodes PDS polypeptide, and represents a new human serine protease inhibitor.

The invention also comprises diagnostic tests for physiologically or pathologically compromised pancreas which include the steps of testing a sample or an extract thereof with pds DNA, fragments or oligomers thereof. Further aspects of the invention include the antisense DNA of pds; cloning or expression vectors containing pds; host cells or organisms transformed with expression vectors containing pds; a method for the production and recovery of purified PDS polypeptide from host cells; purified PDS polypeptide; antibodies to PDS, and pharmacological compounds using PDS.

DESCRIPTION OF THE FIGURES

FIGS. 1A and 1B 1C, 1D display the nucleotide sequence (SEQ ID NO: 1) for pds and the predicted amino acid sequence of PDS polypeptide (SEQ ID NO: 2).

FIGS. 2A, 2B and 2C, 2D show the amino acid alignment of PDS with human and rat serpins. Alignments shown were produced using the multisequence alignment program of DNASTAR software (DNASTAR Inc, Madison Wis.).

DETAILED DESCRIPTION OF THE INVENTION

Definitions

As used herein, pancreas-derived serpin refers to an PDS polypeptide, naturally occurring PDS polypeptide, or active fragments thereof, which are encoded by mRNAs transcribed from the cDNA of Seq ID NO: 1.

"Active" refers to those forms of PDS which retain biologic and/or immunologic activities of any naturally occurring PDS.

"Naturally occurring PDS" refers to PDS produced by human cells that have not been genetically engineered and specifically contemplates various PDSs arising from post-translational modifications of the polypeptide including but not limited to acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation.

"Derivative" refers to polypeptides derived from naturally occurring PDS by chemical modifications such as ubiquitination, labeling (e.g., with radionuclides, various enzymes, etc.), pegylation (derivatization with polyethylene glycol), or by insertion (or substitution by chemical synthesis) of amino acids (aa) such as ornithine, which do not normally occur in human proteins.

"Recombinant variant" refers to any polypeptide differing from naturally occurring PDS by amino acid insertions, deletions, and substitutions, created using recombinant DNA techniques.

Preferably, amino acid "substitutions" are the result of replacing one aa with another aa having similar structural and/or chemical properties, such as the replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, or a threonine with a serine, i.e., conservative aa replacements. "Insertions" or "deletions" are typically in the range of about 1 to 5 amino acid. The variation allowed may be experimentally determined by systematically making insertions, deletions, or substitutions of amino acid in an PDS molecule using recombinant DNA techniques and assaying the resulting recombinant variants for activity.

Where desired, a "signal or leader sequence" can direct the polypeptide through the membrane of a cell. Such a sequence may be naturally present on the polypeptides of the present invention or provided from heterologous protein sources by recombinant DNA techniques.

A polypeptide "fragment," "portion," or "segment" is a stretch of amino acid residues of at least about 5 amino acid, often at least about 7 aa, typically at least about 9 to 13 aa, and, in various embodiments, at least about 17 or more amino acid. To be active, any PDS polypeptide must have sufficient length to display biologic and/or immunologic activity on their own or when conjugated to a carrier protein such as keyhole limpet hemocyanin.

An "oligonucleotide" or polynucleotide "fragment", "portion," or "segment" is a stretch of nucleotide residues which is long enough to use in polymerase chain reaction (PCR) or various hybridization procedures to amplify or simply reveal related parts of mRNA or DNA molecules. One or both oligonucleotide probes will comprise sequence that is identical or complementary to a portion of PDS where there is little or no identity or complementarity with any known or prior art molecule. The oligonucleotide probes will generally comprise between about 10 nucleotides and 50 nucleotides, and preferably between about 15 nucleotides and about 30 nucleotides.

"Animal" as used herein may be defined to include human, domestic or agricultural (cats, dogs, cows, sheep, etc) or test species (mouse, rat, rabbit, etc).

The present invention includes purified PDS polypeptides from natural or recombinant sources, cells transformed with recombinant nucleic acid molecules encoding PDS. Various methods for the isolation of the PDS polypeptides may be accomplished by procedures well known in the art. For example, such polypeptides may be purified by immunoaffinity chromatography by employing the antibodies provided by the present invention. Various other methods of protein purification well known in the art include those described in Deutscher M (1990) Methods in Enzymology, Vol 182, Academic Press, San Diego Calif.; and Scopes R (1982) Protein Purification: Principles and Practice. Springer-Verlag, New York City, both incorporated herein by reference.

"Recombinant" may also refer to a polynucleotide which encodes PDS and is prepared using recombinant DNA techniques. The DNAs which encode PDS may also include allelic or recombinant variants and mutants thereof.

"Nucleic acid probes" are prepared based on the cDNA sequences which encode PDS provided by the present invention. Nucleic acid probes comprise portions of the sequence having fewer nucleotides than about 6 kb, usually fewer than about 1 kb. After appropriate testing to eliminate false positives, these probes may be used to determine whether mRNAs encoding PDS are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA extracted from such cells or tissues as described by Walsh P S et al (1992, PCR Methods Appl 1:241-250).

Probes may be derived from naturally occurring or recombinant single- or double-stranded nucleic acids or be chemically synthesized. They may be labeled by nick translation, Klenow fill-in reaction, PCR or other methods well known in the art. Probes of the present invention, their preparation and/or labeling are elaborated in Sambrook J et al (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor N.Y.; or Ausubel F M et al (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York City, both incorporated herein by reference.

Alternatively, recombinant variants encoding these same or similar polypeptides may be synthesized or selected by making use of the "redundancy" in the genetic code. Various codon substitutions, such as the silent changes which produce various restriction sites, may be introduced to optimize cloning into a plasmid or viral vector or expression in a particular prokaryotic or eukaryotic system. Mutations may also be introduced to modify the properties of the polypeptide, including but not limited to ligand-binding affinities, interchain affinities, or polypeptide degradation or turnover rate. One example involves inserting a stop codon into the nucleotide sequence to limit the size of PDS so as to provide a binding, non-activating ligand of smaller molecular weight which would serve to block the activity of the natural pancreas-derived serpin.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a nucleotide sequence identified in Incyte 222689 uniquely identifying a new, pancreas-derived serpin (PDS) of the cysteine protease family which was expressed in pancreatic cells. Because pds is specifically expressed in pancreas, the nucleic acids (pds), polypeptides (PDS) and antibodies to PDS are useful in diagnostic assays for physiologic or pathologic problems of the pancreas. Increased expression of proteases are known to lead to tissue damage or destruction; therefore, a diagnostic test for the presence and expression of PDS can accelerate diagnosis and proper treatment of such problems.

The nucleotide sequence encoding PDS has numerous applications in techniques known to those skilled in the art of molecular biology. These techniques include use as hybridization probes, use in the construction of oligomers for PCR, use for chromosome and gene mapping, use in the recombinant production of PDS, and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. Uses of nucleotides encoding PDS disclosed herein are exemplary of known techniques and are not intended to limit their use in any technique known to a person of ordinary skill in the art. Furthermore, the nucleotide sequences disclosed herein may be used in molecular biology techniques that have not yet been developed, provided the new techniques rely on properties of nucleotide sequences that are currently known, eg, the triplet genetic code, specific base pair interactions, etc.

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic code, a multitude of PDS-encoding nucleotide sequences, some bearing minimal homology to the nucleotide sequence of any known and naturally occurring gene may be produced. The invention has specifically contemplated each and every possible variation of nucleotide sequence that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code as applied to the nucleotide sequence of naturally occurring PDS, and all such variations are to be considered as being specifically disclosed.

Although the nucleotide sequences which encode PDS and/or its variants are preferably capable of hybridizing to the nucleotide sequence of naturally occurring PDS under stringent conditions, it may be advantageous to produce nucleotide sequences encoding PDS or its derivatives possessing a substantially different codon usage. Codons can be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic expression host in accordance with the frequency with which particular codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding PDS and/or its derivatives without altering the encoded amino acid sequence include the production of RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced from the naturally occurring sequence.

Nucleotide sequences encoding PDS may be joined to a variety of other nucleotide sequences by means of well established recombinant DNA techniques (cf Sambrook J et al. supra). Useful nucleotide sequences for joining to pds include an assortment of cloning vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the art. Vectors of interest include expression vectors, replication vectors, probe generation vectors, sequencing vectors, and the like. In general, vectors of interest may contain an origin of replication functional in at least one organism, convenient restriction endonuclease sensitive sites, and selectable markers for the host cell.

Another aspect of the subject invention is to provide for pds-specific nucleic acid hybridization probes capable of hybridizing with naturally occurring nucleotide sequences encoding PDS. Such probes may also be used for the detection of similar serpin encoding sequences and should preferably contain at least 50% of the nucleotides from the conserved region or active site. The hybridization probes of the subject invention may be derived from the nucleotide sequences of the SEQ ID NO 1 or from genomic sequences including promoters, enhancer elements and/or possible introns of respective naturally occurring pds molecules. Hybridization probes may be labeled by a variety of reporter groups, including radionuclides such as 32P or 35S, or enzymatic labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like.

PCR as described U.S. Pat. Nos. 4,683,195; 4,800,195; and 4,965,188 provides additional uses for oligonucleotides based upon the nucleotide sequence which encodes PDS. Such probes used in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both and comprise a discrete nucleotide sequence for diagnostic use or a degenerate pool of possible sequences for identification of closely related genomic sequences.

Other means of producing specific hybridization probes for pds DNAs include the cloning of nucleic acid sequences encoding PDS or PDS derivatives into vectors for the production of mRNA probes. Such vectors are known in the art and are commercially available and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled nucleotides.

It is now possible to produce a DNA sequence, or portions thereof, encoding PDS and their derivatives entirely by synthetic chemistry, after which the gene can be inserted into any of the many available DNA vectors using reagents, vectors and cells that are known in the art at the time of the filing of this application. Moreover, synthetic chemistry may be used to introduce mutations into the pds sequences or any portion thereof.

The nucleotide sequence can be used in an assay to detect inflammation or disease associated with abnormal levels of expression of PDS. The nucleotide sequence can be labeled by methods known in the art and added to a fluid or tissue sample from a patient under hybridizing conditions. After an incubation period, the sample is washed with a compatible fluid which optionally contains a dye (or other label requiring a developer) if the nucleotide has been labeled with an enzyme. After the compatible fluid is rinsed off, the dye is quantitated and compared with a standard. If the amount of dye is significantly elevated, the nucleotide sequence has hybridized with the sample, and the assay indicates the presence of inflammation and/or disease.

The nucleotide sequence for pancreas-derived serpin can be used to construct hybridization probes for mapping that gene. The nucleotide sequence provided herein may be mapped to a particular chromosome or to specific regions of that chromosome using well known genetic and/or chromosomal mapping techniques. These techniques include in situ hybridization, linkage analysis against known chromosomal markers, hybridization screening with libraries, flow-sorted chromosomal preparations, or artificial chromosome constructions YAC or P1 constructions. The technique of fluorescent in situ hybridization of chromosome spreads has been described, among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York City.

Fluorescent in situ hybridization of chromosomal preparations and other physical chromosome mapping techniques may be correlated with additional genetic map data. Examples of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation between the location of pds on a physical chromosomal map and a specific disease (or predisposition to a specific disease) can help delimit the region of DNA associated with that genetic disease. The nucleotide sequence of the subject invention may be used to detect differences in gene sequence between normal and carrier or affected individuals.

Nucleotide sequences encoding PDS may be used to produce purified PDS using well known methods of recombinant DNA technology. Among the many publications that teach methods for the expression of genes after they have been isolated is Goeddel (1990) Gene Expression Technology, Methods and Enzymology, Vol 185, Academic Press, San Diego Calif. PDS may be expressed in a variety of host cells, either prokaryotic or eukaryotic. Host cells may be from the same species in which PDS nucleotide sequences are endogenous or from a different species. Advantages of producing PDS by recombinant DNA technology include obtaining adequate amounts of the protein for purification and the availability of simplified purification procedures.

Cells transformed with DNA encoding PDS may be cultured under conditions suitable for the expression of serpins and recovery of the protein from the cell culture. PDS produced by a recombinant cell may be secreted or may be contained intracellularly, depending on the pds sequence and the genetic construction used. In general, it is more convenient to prepare recombinant proteins in secreted form. Purification steps vary with the production process and the particular protein produced.

In addition to recombinant production, fragments of PDS may be produced by direct peptide synthesis using solid-phase techniques (cf Stewart et al (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San Francisco Calif.; Merrifield J (1963) J Am Chem Soc 85:2149-2154. In vitro protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431A Peptide Synthesizer (Foster City, California Calif.) in accordance with the instructions provided by the manufacturer. Various fragments of PDS may be chemically synthesized separately and combined using chemical methods to produce the full length molecule.

PDS for antibody induction does not require biological activity; however, the protein must be immunogenic. Peptides used to induce specific antibodies may have an amino acid sequence consisting of at least five aa, preferably at least 10 amino acid. They should mimic a portion of the amino acid sequence of the protein and may contain the entire amino acid sequence of a small naturally occurring molecule such as PDS. Short stretches of PDS aa may be fused with those of another protein such as keyhole limpet hemocyanin and the chimeric molecule used for antibody production.

Antibodies specific for PDS may be produced by inoculation of an appropriate animal with the polypeptide or an antigenic fragment. An antibody is specific for PDS if it is produced against an epitope of the polypeptide and binds to at least part of the natural or recombinant protein. Antibody production includes not only the stimulation of an immune response by injection into animals, but also analogous steps in the production of synthetic antibodies or other specific-binding molecules such as the screening of recombinant immunoglobulin libraries (cf Orlandi R et al (1989) PNAS 86:3833-3837, or Huse W D et al (1989) Science 256:1275-1281) or the in vitro stimulation of lymphocyte populations. Current technology (Winter G and Milstein C (1991) Nature 349:293-299) provides for a number of highly specific binding reagents based on the principles of antibody formation. These techniques may be adapted to produce molecules specifically binding PDSs.

An additional embodiment of the subject invention is the use of PDS as a specific protease inhibitor to treat viral infections, endotoxin or exotoxin poisoning, ischemia, anoxia, direct trauma, and similar physiologic or pathologic problems of the pancreas.

PDS as a bioactive agent or composition may be administered in a suitable therapeutic dose determined by any of several methodologies including clinical studies on mammalian species to determine maximal tolerable dose and on normal human subjects to determine safe dose. Additionally, the bioactive agent may be complexed with a variety of well established compounds or compositions which enhance stability or pharmacological properties such as half-life. It is contemplated that the therapeutic, bioactive composition may be delivered by intravenous infusion into the bloodstream or any other effective means which could be used for treating problems involving excess expression and activity of proteases.

The examples below are provided to illustrate the subject invention. These examples are provided by way of illustration and are not included for the purpose of limiting the invention.

EXAMPLES

I Isolation of mRNA and Construction of CDNA Libraries

The pds sequence was identified among the sequences comprising a human pancreas library. The normal pancreas used for this library was obtained from the Keystone Skin Bank, International Institute for the Advancement of Medicine (Exton Pa.). Normal pancreas tissue from a 56 year old Caucasian male (Lot HDS330) was flash frozen, ground in a mortar and pestle, and lyzed immediately in buffer containing guanidinium isothiocyanate. Lysis was followed by several phenol chloroform extractions and ethanol precipitation. Poly A⁺ RNA was isolated using biotinylated oligo d(T) primer and streptavidin coupled to a paramagnetic particle (Promega Corp, Madison Wis.) and sent to Stratagene (La Jolla Calif.).

Stratagene prepared the cDNA library using oligo d(T) priming. Synthetic adapter oligonucleotides were ligated onto the cDNA molecules enabling them to be inserted into the UNI-ZAP™ vector system (Stratagene). This allowed high efficiency unidirectional (sense orientation) lambda library construction and the convenience of a plasmid system with blue/white color selection to detect clones with cDNA insertions.

The quality of the cDNA library was screened using DNA probes, and then, the pBluescript® phagemid (Stratagene) was excised. This phagemid allows the use of a plasmid system for easy insert characterization, sequencing, site-directed mutagenesis, the creation of unidirectional deletions and expression of fusion polypeptides. Subsequently, the custom-constructed library phage particles were infected into E. coli host strain XL1-BLUE® (Stratagene). The high transformation efficiency of this bacterial strain increases the probability that the cDNA library will contain rare, under-represented clones. Alternative unidirectional vectors might include, but are not limited to, pcDNAI (Invitrogen, San Diego Calif.) and pSHlox-1 (Novagen, Madison Wis.).

II Isolation of CDNA Clones

The phagemid forms of individual cDNA clones were obtained by the in vivo excision process, in which XL1-BLUE was coinfected with an f1 helper phage. Proteins derived from both lambda phage and f1 helper phage initiated new DNA synthesis from defined sequences on the lambda target DNA and create a smaller, single-stranded circular phagemid DNA molecule that includes all DNA sequences of the pBluescript plasmid and the cDNA insert. The phagemid DNA was released from the cells and purified, then used to reinfect fresh bacterial host cells (SOLR, Stratagene Inc), where the double-stranded phagemid DNA was produced. Because the phagemid carries the gene for β-lactamase, the newly transformed bacteria were selected on medium containing ampicillin.

Phagemid DNA was purified using the QIAWELL-8 Plasmid Purification System from QIAGEN® DNA Purification System (QIAGEN Inc, Chatsworth Calif.). This technique provides a rapid and reliable high-throughput method for lysing the bacterial cells and isolating highly purified phagemid DNA. The DNA eluted from the purification resin was suitable for DNA sequencing and other analytical manipulations.

An alternate method of purifying phagemid has recently become available. It utilizes the Miniprep Kit (Catalog No. 77468, available from Advanced Genetic Technologies Corporation, Gaithersburg Md.). This kit is in the 96-well format and provides enough reagents for 960 purifications. Each kit is provided with a recommended protocol, which has been employed except for the following changes. First, the 96 wells are each filled with only 1 ml of sterile terrific broth with carbenicillin at 25 mg/L and glycerol at 0.4%. After the wells are inoculated, the bacteria are cultured for 24 hours and lysed with 60 μl of lysis buffer. A centrifugation step (2900 rpm for 5 minutes) is performed before the contents of the block are added to the primary filter plate. The optional step of adding isopropanol to TRIS buffer is not routinely performed. After the last step in the protocol, samples are transferred to a Beckman 96-well block for storage.

III Sequencing of CDNA Clones

The cDNA inserts from random isolates of the pancreas library were sequenced in part. Methods for DNA sequencing are well known in the art. Conventional enzymatic methods employed DNA polymerase Klenow fragment, SEQUENASE® (US Biochemical Corp, Cleveland, Ohio) or Taq polymerase to extend DNA chains from an oligonucleotide primer annealed to the DNA template of interest. Methods have been developed for the use of both single- and double-stranded templates. The chain termination reaction products were electrophoresed on urea-acrylamide gels and detected either by autoradiography (for radionuclide-labeled precursors) or by fluorescence (for fluorescent-labeled precursors). Recent improvements in mechanized reaction preparation, sequencing and analysis using the fluorescent detection method have permitted expansion in the number of sequences that can be determined per day (using machines such as the Catalyst 800 and the Applied Biosystems 377 or 373 DNA sequencer).

IV Homology Searching of CDNA Clones and Deduced Proteins

Each sequence so obtained was compared to sequences in GenBank using a search algorithm developed by Applied Biosystems Inc. and incorporated into the INHERIT™ 670 Sequence Analysis System. In this algorithm, Pattern Specification Language (developed by TRW Inc.) was used to determine regions of homology. The three parameters that determine how the sequence comparisons run were window size, window offset, and error tolerance. Using a combination of these three parameters, the DNA database was searched for sequences containing regions of homology to the query sequence, and the appropriate sequences were scored with an initial value. Subsequently, these homologous regions were examined using dot matrix homology plots to distinguish regions of homology from chance matches. Smith-Waterman alignments of the protein sequence were used to display the results of the homology search.

Peptide and protein sequence homologies were ascertained using the INHERIT 670 Sequence Analysis System in a way similar to that used in DNA sequence homologies. Pattern Specification Language and parameter windows were used to search protein databases for sequences containing regions of homology which were scored with an initial value. Dot-matrix homology plots were examined to distinguish regions of significant homology from chance matches.

Alternatively, BLAST, which stands for Basic Local Alignment Search Tool, is used to search for local sequence alignments (Altschul SF (1993) J Mol Evol 36:290-300; Altschul, S F et al (1990) J Mol Biol 215:403-10). BLAST produces alignments of both nucleotide and amino acid sequences to determine sequence similarity. Because of the local nature of the alignments, BLAST is especially useful in determining exact matches or in identifying homologues. Although it is ideal for matches which do not contain gaps, it is inappropriate for performing motif-style searching. The fundamental unit of BLAST algorithm output is the high-scoring segment pair (HSP).

An HSP consists of two sequence fragments of arbitrary but equal lengths whose alignment is locally maximal and for which the alignment score meets or exceeds a threshold or cutoff score set by the user. The BLAST approach is to look for HSPs between a query sequence and a database sequence, to evaluate the statistical significance of any matches found, and to report only those matches which satisfy the user-selected threshold of significance. The parameter E establishes the statistically significant threshold for reporting database sequence matches. E is interpreted as the upper bound of the expected frequency of chance occurrence of an HSP (or set of HSPs) within the context of the entire database search. Any database sequence whose match satisfies E is reported in the program output.

The nucleotide sequence for the entire coding region of the pancreas-derived serpin, PDS, claimed in this invention is shown in FIGS. 1A-1B.

V Identification and Full Length Sequencing of the Genes

From all of the randomly picked and sequenced clones of the pancreas library, the PDS sequence was homologous to but clearly different from any known serpin. The complete nucleotide sequence was obtained using GENE AMP XL PCR™ (Perkin Elmer, Foster City Calif.) and oligonucleotides designed from Incyte 222689 to extend the serpin sequence to its full length.

The sequence for the full length pancreas-derived serpin was translated, and the in-frame translation is shown in FIGS. 1A-1B. When all three possible predicted translations of the sequence were searched against protein databases such as SwissProt and PIR, no exact matches were found to the possible translations of PDS. FIGS. 2A, 2B and 2C show the comparison of the PDS amino acid sequence with GenBank human and rat serpins. The substantial regions of homology among these molecules begin at M₂₁₈. The rat serpin with the closest homology and from which the new serpin was identified is missing the first 217 residues. Other diagnostic residues are: 1) P15 which is G₂₄₇, 2) P1 which is M₃₆₂, and 3) P1' is S₃₆₃. It should be noted that PDS has an extra amino acid between P1 and P15. Further analysis of 25 this molecule suggests that it is specific for chymotrypsin-like proteases which cleave their target proteins after hydrophobic residues.

VI Antisense analysis

Knowledge of the cDNA sequence of the new serpin gene will enable its use in antisense technology in the investigation of gene function. Oligonucleotides, genomic or cDNA fragments comprising the antisense strand of PDS can be used either in vitro or in vivo to inhibit expression of the protein. Such technology is now well known in the art, and probes can be designed at various locations along the nucleotide sequence. By treatment of cells or whole test animals with such antisense sequences, the gene of interest can effectively be turned off. Frequently, the function of the gene can be ascertained by observing behavior at the cellular, tissue or organismal level (e.g. lethality, loss of differentiated function, changes in morphology, etc).

In addition to using sequences constructed to interrupt transcription of the open reading frame, modifications of gene expression can be obtained by designing antisense sequences to intron regions, promoter/enhancer elements, or even to trans-acting regulatory genes. Similarly, inhibition can be achieved using Hogeboom base-pairing methodology, also known as "triple helix" base pairing.

VII Expression of PDS

Expression of PDS may be accomplished by subcloning the cDNAs into appropriate expression vectors and transfecting the vectors into appropriate expression hosts. In this particular case, the cloning vector used in the generation of the full length clone also provides for direct expression of the included pds sequence in E. coli. Upstream of the cloning site, this vector contains a promoter for β-galactosidase, followed by sequence containing the amino-terminal Met and the subsequent 7 residues of β-galactosidase. Immediately following these eight residues is an engineered bacteriophage promoter useful for artificial priming and transcription and a number of unique restriction sites, including Eco RI, for cloning.

Induction of the isolated, transfected bacterial strain with IPTG using standard methods will produce a fusion protein corresponding to the first seven residues of β-galactosidase, about 15 residues of "linker", and the peptide encoded within the cDNA. Since cDNA clone inserts are generated by an essentially random process, there is one chance in three that the included cDNA will lie in the correct frame for proper translation. If the cDNA is not in the proper reading frame, it can be obtained by deletion or insertion of the appropriate number of bases by well known methods including in vitro mutagenesis, digestion with exonuclease III or mung bean nuclease, or oligonucleotide linker inclusion.

The pds cDNA can be shuttled into other vectors known to be useful for expression of protein in specific hosts. Oligonucleotide amplimers containing cloning sites as well as a segment of DNA sufficient to hybridize to stretches at both ends of the target cDNA (25 bases) can be synthesized chemically by standard methods. These primers can then used to amplify the desired gene segments by PCR. The resulting new gene segments can be digested with appropriate restriction enzymes under standard conditions and isolated by gel electrophoresis. Alternately, similar gene segments can be produced by digestion of the CDNA with appropriate restriction enzymes and filling in the missing gene segments with chemically synthesized oligonucleotides. Segments of the coding sequence from more than one gene can be ligated together and cloned in appropriate vectors to optimize expression of recombinant sequence.

Suitable expression hosts for such chimeric molecules include but are not limited to mammalian cells such as Chinese Hamster Ovary (CHO) and human 293 cells, insect cells such as Sf9 cells, yeast cells such as Saccharomyces cerevisiae, and bacteria such as E. coli. For each of these cell systems, a useful expression vector may also include an origin of replication to allow propagation in bacteria and a selectable marker such as the β-lactamase antibiotic resistance gene to allow selection in bacteria. In addition, the vectors may include a second selectable marker such as the neomycin phosphotransferase gene to allow selection in transfected eukaryotic host cells. Vectors for use in eukaryotic expression hosts may require RNA processing elements such as 3' polyadenylation sequences if such are not part of the cDNA of interest.

Additionally, the vector may contain promoters or enhancers which increase gene expression. Such promoters are host specific and include MMTV, SV40, or metallothionine promoters for CHO cells; trp, lac, tac or T7 promoters for bacterial hosts, or alpha factor, alcohol oxidase or PGH promoters for yeast. Transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be used in mammalian host cells. Once homogeneous cultures of recombinant cells are obtained through standard culture methods, large quantities of recombinantly produced PDS can be recovered from the conditioned medium and analyzed using chromatographic methods known in the art.

VIII Isolation of Recombinant PDS

PDS may be expressed as a chimeric protein with one or more additional polypeptide domains added to facilitate protein purification. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp., Seattle Wash.). The inclusion of a cleavable linker sequence such as Factor XA or enterokinase(Invitrogen, San Diego Calif.) between the purification domain and the pds sequence may be useful to facilitate expression of PDS.

IX Production of PDS Specific Antibodies

Two approaches are utilized to raise antibodies to PDS, and each approach is useful for generating either polyclonal or monoclonal antibodies. In one approach, denatured protein from the reverse phase HPLC separation is obtained in quantities up to 75 mg. This denatured protein can be used to immunize mice or rabbits using standard protocols; about 100 micrograms are adequate for immunization of a mouse, while up to 1 mg might be used to immunize a rabbit. For identifying mouse hybridomas, the denatured protein can be radioiodinated and used to screen potential murine B-cell hybridomas for those which produce antibody. This procedure requires only small quantities of protein, such that 20 mg would be sufficient for labeling and screening of several thousand clones.

In the second approach, the amino acid sequence of PDS, as deduced from translation of the cDNA, is analyzed to determine regions of high immunogenicity. Oligopeptides comprising appropriate hydrophilic regions, are synthesized and used in suitable immunization protocols to raise antibodies. Analysis to select appropriate epitopes is described by Ausubel FM et al (supra). The optimal amino acid sequences for immunization are usually at the C-terminus, the N-terminus and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed to the external environment when the protein is in its natural conformation.

Typically, selected peptides, about 15 residues in length, are synthesized using an Applied Biosystems Peptide Synthesizer Model 431A using fmoc-chemistry and coupled to keyhole limpet hemocyanin (KLH, Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS; cf. Ausubel FM et al, supra). If necessary, a cysteine may be introduced at the N-terminus of the peptide to permit coupling to KLH. Rabbits are immunized with the peptide-KLH complex in complete Freund's adjuvant. The resulting antisera are tested for antipeptide activity by binding the peptide to plastic, blocking with 1% BSA, reacting with antisera, washing and reacting with labeled (radioactive or fluorescent), affinity purified, specific goat anti-rabbit IgG.

Hybridomas may also be prepared and screened using standard techniques. Hybridomas of interest are detected by screening with labeled PDS to identify those fusions producing the monoclonal antibody with the desired specificity. In a typical protocol, wells of plates (FAST; Becton-Dickinson, Palo Alto, Calif.) are coated with affinity purified, specific rabbit-anti-mouse (or suitable anti-species Ig) antibodies at 10 mg/ml. The coated wells are blocked with 1% BSA, washed and exposed to supernatants from hybridomas. After incubation the wells are exposed to labeled PDS, 1 mg/ml. Clones producing antibodies will bind a quantity of labeled PDS which is detectable above background. Such clones are expanded and subjected to 2 cycles of cloning at limiting dilution (1 cell/3 wells). Cloned hybridomas are injected into pristane mice to produce ascites, and monoclonal antibody is purified from mouse ascitic fluid by affinity chromatography on Protein A. Monoclonal antibodies with affinities of at least 10⁸ M⁻¹, preferably 10⁹ to 10¹⁰ or stronger, will typically be made by standard procedures as described in Harlow and Lane (1988) Antibodies: A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor N.Y.; and in Goding (1986) Monoclonal Antibodies: Principles and Practice, Academic Press, New York City, both incorporated herein by reference.

X Diagnostic Test Using PDS Specific Antibodies

Particular PDS antibodies are useful for the diagnosis of prepathologic conditions, and chronic or acute diseases which are characterized by differences in the amount or distribution of PDS. To date, PDS has only been expressed in the pancreas library and is thus specific for the normal, abnormal or pathological function of the pancreas.

Diagnostic tests for PDS include methods utilizing the antibody and a label to detect PDS in human body fluids, tissues or extracts of such tissues. The polypeptides and antibodies of the present invention may be used with or without modification. Frequently, the polypeptides and antibodies will be labeled by joining them, either covalently or noncovalently, with a substance which provides for a detectable signal. A wide variety of labels and conjugation techniques are known and have been reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent agents, magnetic particles and the like. Patents teaching the use of such labels include U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Also, recombinant immunoglobulins may be produced as shown in U.S. Pat. No. 4,816,567, incorporated herein by reference.

A variety of protocols for measuring soluble or membrane-bound PDS, using either polyclonal or monoclonal antibodies specific for the respective protein are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A two-site monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on PDS is preferred, but a competitive binding assay may be employed. These assays are described, among other places, in Maddox, Del. et al (1983, J Exp Med 158:1211).

Xl Purification of Native PDS Using Specific Antibodies

Native or recombinant PDS can be purified by immunoaffinity chromatography using antibodies specific for PDS. In general, an immunoaffinity column is constructed by covalently coupling the anti-PDS antibody to an activated chromatographic resin.

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a chromatographic resin such as CnBr-activated Sepharose (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's instructions.

Such immunoaffinity columns are utilized in the purification of PDS by preparing a fraction from cells containing PDS in a soluble form. This preparation is derived by solubilization of the whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by other methods well known in the art. Alternatively, soluble PDS containing a signal sequence may be secreted in useful quantity into the medium in which the cells are grown.

A soluble PDS-containing preparation is passed over the immunoaffinity column, and the column is washed under conditions that allow the preferential absorbance of serpin (eg, high ionic strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt antibody/PDS binding (e.g., a buffer of pH 2-3 or a high concentration of a chaotrope such as urea or thiocyanate ion), and PDS is collected.

XII PDS Activity

The activity of purified or expressed PDS may be tested by mixing a known quantity of the enzyme with a potential substrate protease such as chymostrypsin and a purified protein which chymostrypsin usually cleaves. The ability of a given amount of PDS to inhibit chymotrypsin can be assayed by FPLC of the protein fragments produced under a given set of conditions in a specific period of time.

Alternatively, running a sample of the reaction materials on a nondenaturing gel shows the protease inhibitor complex, protease, inhibitor, protein substrate and protein fragments as different sized peptides.

XIII Rational Drug Design

The goal of rational drug design is to produce structural analogs of biologically active polypeptides of interest or of small molecules with which they interact, eg, agonists, antagonists, etc. Any of these examples can be used to fashion drugs which are more active or stable forms of the polypeptide or which enhance or interfere with the function of a polypeptide in vivo (of Hodgson J (1991) Bio/Technology 9:19-21, incorporated herein by reference).

In one approach, the three-dimensional structure of a protein of interest, or of a protein-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, by a combination of the two approaches. Both the shape and charges of the polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding the structure of a polypeptide may be gained by modeling based on the structure of homologous proteins. In both cases, relevant structural information is used to design analogous serpin-like molecules or to identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved activity or stability as shown by Braxton S and Wells J A (1992 Biochemistry 31:7796-7801) or which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda S B et al (1993 J Biochem 113:742-746), incorporated herein by reference.

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated peptides would then act as the pharmacore.

By virtue of the present invention, sufficient amount of polypeptide may be made available to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PDS amino acid sequence provided herein will provide guidance to those employing computer modeling techniques in place of or in addition to x-ray crystallography.

XIV Use and Administration of PDS

Since PDS is an inhibitor, it may be used to treat excessive protease production.

PDS will be formulated in a nontoxic, inert, pharmaceutically acceptable aqueous carrier medium (PDS treatment, PDST) preferably at a pH of about 5 to 8, more preferably 6 to 8, although the pH may vary according to the characteristics of the formulation and its administration. Characteristics such as solubility of the molecule, half-life and antigenicity/immuno-genicity will aid in defining an effective carrier. Native human proteins are preferred as PDST, but recombinant, organic or synthetic molecules resulting from drug design may be equally effective in particular situations.

PDSTs may be delivered by known routes of administration including but not limited to topical creams and gels; transmucosal spray and aerosol, transdermal patch and bandage; injectable, intravenous and lavage formulations; and orally administered liquids and pills, particularly formulated to resist stomach acid and enzymes. The particular formulation, exact dosage, and route of administration will be determined by the attending physician and will vary according to each specific situation.

Such determinations are made by considering multiple variables such as the condition to be treated, the PDST to be administered, and the pharmacokinetic profile of the particular PDST. Additional factors which may be taken into account include disease state (e.g. severity) of the patient, age, weight, gender, diet, time of administration, drug combination, reaction sensitivities, and tolerance/response to therapy. Long acting PDST formulations might be administered every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the particular PDST.

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of about 1 g, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature; see U.S. Pat. Nos. 4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different uses of PDST and that administration targeting a tissue or organ may necessitate delivery in a specific manner.

It is contemplated that pancreatitis or other conditions or diseases of the pancreas caused by viral infections, endotoxin or exotoxin poisoning, ischemia, anoxia, and direct trauma which may cause the overexpression of proteases may be treated with PDSTs.

All publications and patents mentioned in the above specification are herein incorporated by reference. The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. Indeed, various modifications of the above described modes for carrying out the invention which are readily apparent to those skilled in the field of molecular biology or related fields are intended to be within the scope of the following claims.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 2                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1221 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (vii) IMMEDIATE SOURCE:                                                        (A) LIBRARY: Pancreas                                                          (B) CLONE: 222689                                                              (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 1..1221                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        ATGGACACAATCTTCGNGTGGAGTCTTCTATTGCTGTTTNGGGGNAGTCAAGCCTCAAGA60                 TGCTCAGCTCAAAAAAATACCGAATTTGGAGTGGATCTTTATCAAGAGGTTTCCTTGTCT120                CATAAGGACAACATTATTTTTNCACCCCTTGGAATANCTTTGGNTCTCGAGATGGNACAA180                CTGGGAGCCAAAGGAAAAGCACAGCAGNAGNTAAGACAAACTTTACAACAACAGGAANCC240                TCAGCTGGGGAAGAATTTCTTTGTNCTGAAGTCATTTTCTCTCTGCCATCTCAGAGAAAA300                AACAAGAATTTACATTTAATCTTGCCAATGCCCTCTACCTNTCAAGAAGGATTCACTGTG360                AAAGAACAGTATCTCCATGGCAACAAGGAATNTTTTCAGAGTGCTATAAAACTGGTGGAT420                TTTCAAGATGCAAAGGCTTGTGCAGGGATGATAAGTACCTGGGTAGAAAGAAAAACAGAT480                GGAAAAATTAAAGACATGTTTTCAGGGGAAGAATTTGGCCCTCTGACTCGGCTTGTCCTG540                GTGAATGCTATTTATTTCAAAGGAGATTGGAAACAGAAATTCAGAAAAGAGGACACACAG600                CTGATAAATTTTACTAAGAAAAATGGTTCAACTGTCAAAATTCCAATGATGAAGGCTCTT660                CTGAGAACAAAATATGGTTATTTTTCTGAATCTTCCCTGAACTACCAAGTTTTAGAATTG720                TCTTACAAAGGTGATGAATTTAGCTTAATTATCATACTTCCTGCAGAAGGTATGGATATA780                GAAGAAGTGGAAAAACTAATTACTGCTCAACAAATCCTAAAATGGCTCTCTGAGATGCAA840                GAAGAGGAAGTAGAAATAAGCCTCCCTAGATTTAAAGTAGAACAAAAAGTAGACTTCAAA900                GACGTTTTGTTTTCTTTGAACATAACCGAGATATTTAGTGGTGGCTGCGACCTTTCTGGA960                ATAACAGATTCTTCTGAAGTGTATGTTTCCCAAGTGACGCAAAAAGTTTTCTTTGAGATA1020               AATGAAGATGGTAGTGAAGCTGCAACATCAACTGGCATACACATCCCTGTGATCATGAGT1080               CTGGCTCAAAGCCAATTTATAGCAAATCATCCATTTCTGTTTATTATGAAGCATAACCCA1140               ACAGAATCAATTCTGTTTATGGGAAGAGTGACAAATCCTGACACCCAGGAGATAAAAGGA1200               AGAGATTTAGATTCACTGTGA1221                                                      (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 406 amino acids                                                    (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetAspThrIlePheXaaTrpSerLeuLeuLeuLeuPheXaaGlySer                               151015                                                                         GlnAlaSerArgCysSerAlaGlnLysAsnThrGluPheGlyValAsp                               202530                                                                         LeuTyrGlnGluValSerLeuSerHisLysAspAsnIleIlePheXaa                               354045                                                                         ProLeuGlyIleXaaLeuXaaLeuGluMetXaaGlnLeuGlyAlaLys                               505560                                                                         GlyLysAlaGlnGlnXaaXaaArgGlnThrLeuGlnGlnGlnGluXaa                               65707580                                                                       SerAlaGlyGluGluPheLeuCysXaaGluValIlePheSerLeuPro                               859095                                                                         SerGlnArgLysAsnLysAsnLeuHisLeuIleLeuProMetProSer                               100105110                                                                      ThrXaaGlnGluGlyPheThrValLysGluGlnTyrLeuHisGlyAsn                               115120125                                                                      LysGluXaaPheGlnSerAlaIleLysLeuValAspPheGlnAspAla                               130135140                                                                      LysAlaCysAlaGlyMetIleSerThrTrpValGluArgLysThrAsp                               145150155160                                                                   GlyLysIleLysAspMetPheSerGlyGluGluPheGlyProLeuThr                               165170175                                                                      ArgLeuValLeuValAsnAlaIleTyrPheLysGlyAspTrpLysGln                               180185190                                                                      LysPheArgLysGluAspThrGlnLeuIleAsnPheThrLysLysAsn                               195200205                                                                      GlySerThrValLysIleProMetMetLysAlaLeuLeuArgThrLys                               210215220                                                                      TyrGlyTyrPheSerGluSerSerLeuAsnTyrGlnValLeuGluLeu                               225230235240                                                                   SerTyrLysGlyAspGluPheSerLeuIleIleIleLeuProAlaGlu                               245250255                                                                      GlyMetAspIleGluGluValGluLysLeuIleThrAlaGlnGlnIle                               260265270                                                                      LeuLysTrpLeuSerGluMetGlnGluGluGluValGluIleSerLeu                               275280285                                                                      ProArgPheLysValGluGlnLysValAspPheLysAspValLeuPhe                               290295300                                                                      SerLeuAsnIleThrGluIlePheSerGlyGlyCysAspLeuSerGly                               305310315320                                                                   IleThrAspSerSerGluValTyrValSerGlnValThrGlnLysVal                               325330335                                                                      PhePheGluIleAsnGluAspGlySerGluAlaAlaThrSerThrGly                               340345350                                                                      IleHisIleProValIleMetSerLeuAlaGlnSerGlnPheIleAla                               355360365                                                                      AsnHisProPheLeuPheIleMetLysHisAsnProThrGluSerIle                               370375380                                                                      LeuPheMetGlyArgValThrAsnProAspThrGlnGluIleLysGly                               385390395400                                                                   ArgAspLeuAspSerLeu                                                             405                                                                            __________________________________________________________________________ 

We claim:
 1. A purified polynucleotide consisting of a polynucleotide sequence encoding the polypeptide consisting of the sequence of SEQ ID NO.2, or its complement.
 2. The polynucleotide of claim 1 having the sequence of SEQ ID NO:
 1. 3. A purified polynucleotide capable of specifically hybridizing to the polynucleotide consisting of the polynucleotide sequence of SEQ ID NO:1, or its complement.
 4. An expression vector comprising the purified polynucleotide of claim
 1. 5. A host cell comprising the expression vector of claim
 4. 6. A purified oligonucleotide comprising between 10 and 50 contiguous nucleotides from the polynucleotide of claim
 2. 7. The oligonucleotide of claim 6 comprising between 15 and 30 nucleotides.
 8. A method for producing the polypeptide consisting of SEQ ID NO: 2, said method comprising:a) culturing the host cells of claim 5 under conditions suitable for the expression of the polypeptide, and b) recovering said polypeptide from the cell culture. 